Data Mining: An experimental approach with WEKA on UCI Dataset

نویسندگان

  • Ajay Kumar
  • Indranath Chatterjee
  • Gregory Piatetsky-Shapiro
  • Micheline Kamber
  • Jian Pei
  • David Landgrebe
  • Dan Geiger
چکیده

Data mining became a popular research field these days. The reasons that attracted attention in information technology, the discovery of meaningful information from large collections of data. Data mining is the perception that we are data rich but very much information poor. Large amount of data is available all around but we can hardly able to turn them in to useful information. The comparative analysis of available classification and clustering algorithms is provided in this paper through theoretical and practical approach with WEKA tool. It also includes the future directions for researchers in the field of data mining.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Execution of APRIORI Algorithm of Data Mining Directed Towards Tumultuous Crimes Concerning Women

Apriori Algorithm is the most popular and useful algorithm of Association Rule Mining of Data Mining. As Association rule of data mining is used in all real life applications of business and industry. Objective of taking Apriori is to find frequent itemsets and to uncover the hidden information. This paper elaborates upon the use of association rule mining in extracting patterns that occur freq...

متن کامل

Data Mining with Weka Heart Disease Dataset

The dataset used in this exercise is the heart disease dataset available in heart-c.arff obtained from the UCI repository. This dataset describes risk factors for heart disease. The attribute num represents the (binary) class attribute: class <50 means no disease; class >50_1 indicates increased level of heart disease. The main aim of this exercise is to predict heart disease from the other att...

متن کامل

Diagnosis and Prognosis Breast Cancer Using Classification Rules

Breast Cancer is highly heterogeneous disease. Breast Cancer Diagnosis and Prognosis are two medical challenges to the researchers in the field of clinical research. Breast self-exam and mammography can help find early diagnosis of breast cancer. This is possible when in some situation or stage the treatment is possible. Treatment may consist of radiation, lumpectomy, and mastectomy and hormone...

متن کامل

Mining Big Data: Breast Cancer Prediction using DT - SVM Hybrid Model

Breast Cancer is becoming a leading cause of death among women in the whole world; meanwhile, it is confirmed that the early detection and accurate diagnosis of this disease can ensure a long survival of the patients. This paper work presents a disease status prediction employing a hybrid methodology to forecast the changes and its consequence that is crucial for lethal infections. To alarm the...

متن کامل

Comparative Analysis of Data Reduction Model for Diabetes

Mining of the Data now a day plays a major role and concern in the present world in the industry and also in the research areas. -Data mining is the process of extracting hidden information from a large set of database and it can help researchers gain both novel and deep insights of unprecedented understanding of large biomedical datasets. Data mining can uncover new biomedical and healthcare k...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2016